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FormCalc is a Mathematica package for the automatic computation of tree-level and one-loop Feynman am- 
plitudes. It accepts diagrams generated by FeynArts, simplifies them, and generates a complete Fortran code for 
their numerical evaluation. Version 4 includes new features which enhance performance, convenience of use, and 
modularity /code reusability. 



1. Introduction 

FormCalc [[J is a Mathematica package for the 
calculation of Feynman amplitudes. It prepares 
the amphtudes generated by FeynArts [[2] for nu- 
merical evaluation. This consists of an algebraic 
simplification step, including e.g. the tensor re- 
duction and the calculation of fcrmionic traces, 
and a code generation step, where the complete 
Fortran code for the evaluation of the squared ma- 
trix element is written out. Currently, diagrams 
up to one loop can be simplified, and kinemat- 
ics are supplied for 1 -^ 2, 2 ^ 2, and 2 — > 3 
processes. 

The present article describes the new features 
added in version 4. They can be classified into 

Performance features: 

- Weyl-van dcr Waerden (WvdW) formalism, 

- phase-space integration by the Cuba library, 

- parallelization by shell script. 

Convenience features: 

- log-file management, 

- a simple way to resume aborted calculations, 

- shell scripts to perform common tasks. 

Modularity /reusability features: 

- general-purpose utilities library, 

- organization into master- and sub-makefiles, 

- symbol prefixing to avoid name conflicts. 

Furthermore, the new Feynlnstall script greatly 
simplifies the installation or upgrade of the Feyn- 
Arts, FormCalc, and LoopTools packages. 



2. Weyl van der Waerden formalism 

Amplitudes involving external fermions have 
the form 



X = ^ c, F, , 
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where the Fi are (products of) fermion chains. 
The textbook recipe is to compute probabilities, 
such as 
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and evaluate the F*Fj by standard trace tech- 
niques. 

The problem with this approach is that instead 
of n of the Fi one needs to compute n^ of the 
F*Fj. Since essentially n ~ (number of vectors)!, 
this quickly becomes a limiting factor in problems 
involving many vectors, e.g. in multi-particle final 
states or polarization effects. 

The solution is of course to compute the am- 
plitude A4 directly and this is done most conve- 
niently in the WvdW formalism [ |S] . The im- 
plementation of this technique in an automated 
program has already been outlined in [^ and is 
now tested and available in FormCalc 4. 

The FermionChains option of CalcFeynAmp 
determines how fermion chains are returned: 
Weyl, the default, selects Weyl chains. Chiral 
and VA select Dirac chains in the chiral (a;+/w_) 
and vector/axial- vector (I/75) decomposition, re- 
spectively. The Weyl chains do not need to be 



further evaluated with HelicityME, which apphes 
the trace technique. 

The WvdW method has other advantages, too: 
Polarization does not 'cost' extra in terms of CPU 
time, that is, one gets the spin physics for free. 
Whereas with the trace technique the formulas 
become significantly more bloated when polariza- 
tion is taken into account, in the WvdW formal- 
ism one actually needs to sum up the polarized 
amplitudes to get the unpolarized result. 

There is also better numerical stability because 
components of fc'' are arranged as 'large' and 
'small' matrix entries, viz. 
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Cancellations of the form ko — k^ = \/W + rr^ — 
vfc2 for 7TT. ^ fc are avoided and hence mass effects 
are treated more accurately. 

3. Phase-space integration 

The recently completed Cuba library [E] has 
been integrated into FormCalc 4. It provides 
four subroutines for multidimensional numerical 
integration. All four have a very similar invoca- 
tion and can thus be interchanged easily, e.g. for 
comparison. The flexibility of a general-purpose 
method is particularly useful in the setting of au- 
tomatically generated code. 

The following algorithms are included: 

Vegas is the classic Monte Carlo algorithm 
which uses importance sampling for variance re- 
duction. It iteratively builds up a piecewise con- 
stant weight function, represented on a rectan- 
gular grid. Each iteration consists of a sampling 
step followed by a refinement of the grid. The 
present implementation uses Sobol quasi-random 
numbers for sampling. 

Suave is a crossover between Vegas and Miser 
and combines Vegas-style importance sampling 
with globally adaptive subdivision: Until the re- 
quested accuracy is reached, the region with the 
largest error is bisected along the axis in which 
the fluctuations of the integrand are reduced 
most. In each half the number of new samples 
is prorated for the fluctuation. 



Divonne is a further development of the Cern- 
LiB routine D151. It is intrinsically a Monte 
Carlo algorithm but has cubature rules built in for 
comparison, too. The variance-reduction method 
is stratified sampling. In a first step, a tessellation 
of the integration region is constructed in which 
all subregions have an approximately equal value 
of the spread, defined as 

s{r) = — - — (max/(f ) -min/(f )). (4) 

Minimum and maximum here are sought using 
methods from numerical optimization. The sub- 
regions are then sampled independently with a 
number of points extrapolated to reach the re- 
quired accuracy. For each region, the latterly ob- 
tained value is compared to the initial rough es- 
timate and if the two are not compatible within 
their errors, the region is subdivided or sampled 
once more. Additions with respect to the Cern- 
LiB version are the final comparison phase and 
the possibility to point out known extrema, to 
speed up convergence. 

Cuhre is a new implementation of dcuhre. It 
is a deterministic algorithm which employs cu- 
bature rules of a polynomial degree. Variance re- 
duction is by globally adaptive subdivision: Until 
the requested accuracy is reached, bisect the re- 
gion with the largest error along the axis with the 
largest fourth difference. 

Fig. m compares the performance of the four 
algorithms for a real phase-space integration of 
the process e^e^ — > tt^. Above all it is very im- 
portant to have several independent integration 
methods to cross-check the results. 

4. Parallelization 

Calculations in models like the MSSM, where 
not all input parameters are yet known, often re- 
quire extensive scans to cover an interesting part 
of the parameter space. Such a scan can be a real 
CPU hog, but on the other hand, the calculation 
can be performed completely independently for 
each parameter set and is thus an ideal candidate 
for parallelization. The real question is thus not 
how to parallelize the calculation, but how to au- 
tomate the parallelization. 
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Figure 1. The number of integrand evaluations 
for the phase-space integration of e^e'^ -^ it"f at 
a requested relative accuracy of 10^"^. 



In FormCalc, the user may specify parameter 
loops by defining preprocessor variables, e.g. 

#define LOOPl do 1 TB = 2, 30 

These definitions are substituted at compile time 
into a main loop of the form 

LDOPl 
L00P2 



calculate cross-section 
1 continue 

The obstacle to automatic parallelization is that 
the loops are user-defined and in general nested. 
A serial number is introduced to unroll the loops: 

s = 
LOOPl 
L00P2 

s = s + 1 

if ( s not in allowed range ) goto 1 
calculate cross-section 
1 continue 

The serial number range can be specified on the 
command line so that it is quite straightforward 



to distribute patches of serial numbers on differ- 
ent machines. Most easily this is done in an in- 
terleaved manner, since one then does not need 
to know to which upper limit the serial number 
runs, i.e. if there are A^ machines available, send 
serial numbers 1, A + 1, 2N + 1, etc. on machine 
1, send serial numbers 2, A^ -I- 2, 2A^ -|- 2, etc. on 
machine 2, . . . 

This procedure is completely automated in 
FormCalc 4: The user once creates a . submitrc 
file in his home directory and lists there all ma- 
chines that may be used, one on each line. In 
the case of multi-processor machines he puts the 
number of processors after the host name. 

The executable compiled from FormCalc code, 
typically called run, is then simply prefixed with 
submit. For instance, instead of 

run uuuu 500,1000 

the user invokes 

submit run uuuu 500,1000 

The submit script uses ruptime to determine the 
load of the machines and ssh to log in. Handling 
of the serial number is invisible to the user. 

5. Log- file management and Resume 

Due to the parallelization mechanism, a single 
output file is no longer sufficient. Instead of a 
single log file, FormCalc 4 creates a log directory, 
opens one log file for each serial number in this 
directory, and redirects console output to this file. 

Each log file contains both the 'real' data and 
the 'chatter' (progress, warning, and error mes- 
sages). This has the advantage that no unit num- 
bers must be passed between subroutines - every 
bit of output is simply written to the console (unit 
* in Fortran). It also makes it easier to pinpoint 
errors, since the error message appears right next 
to the corrupted data. The 'real' data are marked 
by an @ in column 1 and there exists a simple 
shell script, data, to extract the real data from 
the log file. 

The new log-file management also provides an 
easy way to resume an aborted calculation. This 
works as follows: when running through the loops 
of a parameter scan, the log file for a particular 



serial number 

- may not exist: 

then it is created with execute permissions, 

- may exist, but have execute permissions: 
then it is overwritten, 

- may exist and have read- write permissions: 
then this serial number is skipped. 

The execute permissions, which serve here merely 
as a flag to indicate an ongoing calculation, are 
reduced to ordinary read-write permissions when 
the log file is closed. 

In other words, the program skips over the 
parts of the calculation that are already finished, 
so all the user has to do to resume an aborted 
calculation is start the program again with the 
same parameters. 

6. Shell scripts 

FormCalc 4 includes a few useful shell scripts: 

sfx packs all source files (but not object, ex- 
ecutable, or log files) in the directory it is in- 
voked in into a mail-safe self-extracting archive. 
For example, if sfx is invoked in the directory 
myprocess, it produces myprocess . sfx. This file 
can e.g. be mailed to a collaborator, who needs to 
say "myprocess . sfx x" to unpack the contents. 

pnuglot produces a high-quality plot in En- 
capsulated PostScript format from a data file 
in just one line. In fact, pnuglot does not 
even make the plot itself, it writes out a shell 
script to do that, thus "pnuglot mydata" cre- 
ates mydata . gpl which then runs gnuplot, I^TJt;X, 
and dvips to create mydata.eps. The advantage 
of this indirect method is that the default gnu- 
plot commands in mydata. gpl can subsequently 
be edited to suit the user's taste. Adding a label 
or choosing a different line width is, for example, 
a pretty trivial matter. Needless to say, all la- 
bels are in M^t;X and Type 1 fonts are selected to 
make the EPS file nicely scalable. 

turnof f switches off (and on) the evaluation of 
certain parts of the amplitude, which is a handy 
thing for testing. For example, "turnof f box" 
switches off all parts of the amplitude with 'box' 
in their name. Invoking turnof f without any ar- 
gument restores all modules. 



7. Libraries and Makefiles 

The Fortran code is organized in FormCalc 4 
into a main code directory, which contains the 
main program and all its prerequisite files, and 
subsidiary 'folders' (subdirectories to the main 
code directory). The default setup looks like this: 

main code directory 
(created by SetupCodeDir) 



u 



squared_me/ 

(generated by WriteSquaredME) 

renconst/ 

(generated by WriteRenConst) 

util/ 

(comes witli FormCalc) 



Each folder is equipped with its own makefile 
which makes a library of the same name, e.g. the 
makefile in util/ makes the library util. a. 

These sub-makefiles are orchestrated by the 
master makefile. Libraries required for the main 
program are listed in the LIBS variable and built 
automatically by invoking the sub-makefiles: 

LIBS = squared_me.a renconst. a util. a 

The master makefile is no longer overwritten in 
the code-generation process and is treated like the 
other driver programs, i.e. a customized copy can 
be saved in the local drivers directory. 

Occasionally it is useful to have more than 
one instance of squared_me (or renconst), e.g. 
when computing an hadronic cross-section to 
which several partonic processes contribute. Both 
WriteSquaredME and WriteRenConst have the 
Folder option, with which a unique folder name 
can be chosen, and the SymbolPrefix option, 
with which the symbols visible to the linker can 
be prefixed with a unique identifier. This identi- 
fier is inserted only at compile time and can easily 
be changed in the sub-makefile at any time. 

The util library is a collection of ancillary rou- 
tines which currently includes: 

- System utilities (log file management), 

- Kinematic functions (Pair, Eps, . . . ), 

- Diagonalization routines (Eigenvalues, . . . ), 

- Univariate integrators (Gauss, Patterson), 

- Multivariate integrators (Cuba library). 



Older versions of FormCalc used to include these 
directly into the main program. The library ver- 
sion has the advantage that the linker selects only 
the routines that are actually needed, and fur- 
thermore it is straightforward to add new code. 
The util . a library is compiled once when Form- 
Calc is installed and then copied to the main code 
directory, thus avoiding unnecessary compiles. 

8. Summary 

FormCalc 4 is the latest release of the Math- 
ematica package FormCalc for computing Feyn- 
man diagrams. It has many new features enhanc- 
ing in particular performance, convenience, and 
modularity. 

The package is available as open source and 
stands under the GNU library general public li- 
cense (LGPL). It can be obtained from the Web 
site http : //www . f eynarts . de/f ormcalc. 

Installation should be particularly painless 
with the new Feynlnstall script. The user only 
has to answer 'y' when the script asks whether to 
install FormCalc. 
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